NetSHa: In-Network Acceleration of LSH-Based Distributed Search

نویسندگان

چکیده

Locality Sensitive Hashing (LSH) is widely adopted to index similar data in high-dimensional space for approximate nearest neighbor search. Demanding applications (e.g. web search) mean that LSH must exhibit low response times and high throughput. To achieve this, they tend load balance between multiple machines. However, as the scale of concurrent queries volume grow, large numbers messages are required. Hence, network a key bottleneck. address this gap, we propose NetSHa, which exploits computational capacity programmable switches. Specifically, introduce heuristic sort-reduce approach drop potentially poor candidate answers while preserving search quality. Then, NetSHa aggregates good from different when transmitting them. Through it reduces communication cost. Furthermore, best-effort replacement mechanism improve its concurrency. We implement on Barefoot Tofino switch evaluate using 7 real-world datasets. The experimental results show packet by $4\sim 10$ improves efficiency least 3× comparison with typical LSH-based distributed frameworks.

منابع مشابه

LSH At Large - Distributed KNN Search in High Dimensions

We consider K-Nearest Neighbor search for high dimensional data in large-scale structured Peer-to-Peer networks. We present an efficient mapping scheme based on p-stable Locality Sensitive Hashing to assign hash buckets to peers in a Chord-style overlay network. To minimize network traffic, we process queries in an incremental top-K fashion leveraging on a locality preserving mapping to the pee...

متن کامل

analysis of power in the network society

اندیشمندان و صاحب نظران علوم اجتماعی بر این باورند که مرحله تازه ای در تاریخ جوامع بشری اغاز شده است. ویژگیهای این جامعه نو را می توان پدیده هایی از جمله اقتصاد اطلاعاتی جهانی ، هندسه متغیر شبکه ای، فرهنگ مجاز واقعی ، توسعه حیرت انگیز فناوری های دیجیتال، خدمات پیوسته و نیز فشردگی زمان و مکان برشمرد. از سوی دیگر قدرت به عنوان موضوع اصلی علم سیاست جایگاه مهمی در روابط انسانی دارد، قدرت و بازتولید...

15 صفحه اول

A harmony search-based approach for real-time volt & var control in distribution network by considering distributed generations units

In recent decade, development of telecommunications infrastructure has led to rapid exchange of data between the distribution network components and the control center in many developed countries. These changes, considering the numerous benefits of the Distributed Generators (DGs), have made more motivations for distribution companies to utilize these kinds of generators more than ever before. ...

متن کامل

Radial Basis Neural Network Based Islanding Detection in Distributed Generation

This article presents a Radial Basis Neural Network (RBNN) based islanding detection technique. Islanding detection and prevention is a mandatory requirement for grid-connected distributed generation (DG) systems. Several methods based on passive and active detection scheme have been proposed. While passive schemes have a large non detection zone (NDZ), concern has been raised on active method ...

متن کامل

lsh, Nearest neighbor search in high dimensions

Calculating distance pairs is O(n2) in memory and time and finding the nearest neighbor is O(n) in time. Tree indexing techniques like kd-tree [2] were developed to cope with large n, however their performance quickly breaks down for p > 3 [3]. Locality sensitive hashing (LSH) [3] is a technique for generating hash numbers from high dimensional data, such that nearby points have identical hashe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems

سال: 2022

ISSN: ['1045-9219', '1558-2183', '2161-9883']

DOI: https://doi.org/10.1109/tpds.2021.3135842